applying data mining techniques to extract hidden patterns about breast cancer survival in an iranian cohort study

نویسندگان

hamid reza khalkhali

hadi lotfnezhad afshar

omid esnaashari

nasrollah jabbari

چکیده

background : breast cancer survival has been analyzed by many standard data mining algorithms. a group of these algorithms belonged to the decision tree category. ability of the decision tree algorithms in terms of visualizing and formulating of hidden patterns among study variables were main reasons to apply an algorithm from the decision tree category in the current study that has not studied already. methods : the classification and regression trees (cart) was applied to a breast cancer database contained information on569 patients in 2007-2010. the measurement of gini impurity used for categorical target variables was utilized. the classification error that is a function of tree size was measured by 10-fold cross-validation experiments. the performance of created model was evaluated by the criteria as accuracy, sensitivity and specificity. results : the cart model produced a decision tree with 17 nodes, 9 of which were associated with a set of rules. the rules were meaningful clinically. they showed in the if-then format that stage was the most important variable for predicting breast cancer survival. the scores of accuracy, sensitivity and specificity were: 80.3%, 93.5% and 53%, respectively. conclusions : the current study model as the first one created by the cart was able to extract useful hidden rules from a relatively small size dataset.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the clustering and classification data mining techniques in insurance fraud detection:the case of iranian car insurance

با توجه به گسترش روز افزون تقلب در حوزه بیمه به خصوص در بخش بیمه اتومبیل و تبعات منفی آن برای شرکت های بیمه، به کارگیری روش های مناسب و کارآمد به منظور شناسایی و کشف تقلب در این حوزه امری ضروری است. درک الگوی موجود در داده های مربوط به مطالبات گزارش شده گذشته می تواند در کشف واقعی یا غیرواقعی بودن ادعای خسارت، مفید باشد. یکی از متداول ترین و پرکاربردترین راه های کشف الگوی داده ها استفاده از ر...

Using data mining techniques for predicting the survival rate of breast cancer patients: a review article

    This review was conducted between December 2018 and March 2019 at Isfahan University of Medical Sciences. A review of various studies revealed what data mining techniques to predict the probability of survival, what risk factors for these predictions, what criteria for evaluating data mining techniques, and finally what data sources for it have been used to predict the surv...

متن کامل

Extracting the Hidden Patterns Affecting Mental Health through Data Mining Techniques

Background and Objective: This study was conducted to shed light on the hidden relationships, trends, and patterns of the teenagers’ mental health dataset based on data mining techniques. Materials and Methods: The proposed method has four parts as follows: data preprocessing, data cleaning, target class selection, and extracting rules. The classes included inappropriate, moderate, and accepta...

متن کامل

Detection of Breast Cancer Progress Using Adaptive Nero Fuzzy Inference System and Data Mining Techniques

Prediction, diagnosis, recovery and recurrence of the breast cancer among the patients are always one of the most important challenges for explorers and scientists. Nowadays by using of the bioinformatics sciences, these challenges can be eliminated by using of the previous information of patients records. In this paper has been used adaptive nero fuzzy inference system and data mining techniqu...

متن کامل

Applying Data Mining Techniques to Medical Databases

The data mining techniques such as Neural Network, Naïve Bayes, and Association rules are at present not well explored on medical databases. In this paper, we present and analyze our experimental results on thrombosis medical database by employing data mining tool of XLMiner and using different data mining techniques such as Naive Bayes and Neural Network for classification, Association rules, ...

متن کامل

Does ethnicity affect survival following colorectal cancer? A prospective, cohort study using Iranian cancer registry

  Background:The present study compared the differences between survivals of patients with colorectal cancer according to their ethnicity adjusted for other predictors of survival.   Methods: In this prospective cohort study patients were followed up from definite diagnosis of colorectal cancer to death. Totally, 2431 person-year follow-ups were undertaken for 1127 colorectal cancer patients on...

متن کامل

منابع من

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید


عنوان ژورنال:
journal of research in health sciences

جلد ۱۶، شماره ۱، صفحات ۳۱-۰

میزبانی شده توسط پلتفرم ابری doprax.com

copyright © 2015-2023